NTT Data: Description of the Erie System Used for MUC-6
نویسندگان
چکیده
Erie is a name recognition system developed for the Multilingual Entity Task (MET) in MUC-6. The pattern matching engine recognizes organization, person, and place names along with time and numeric expressions in Japanese text. Although our previous information extraction system Textract performed well in MUC-5, the pattern matching engine, which was written in AWK language, was slow[2]. System maintenance was also difficult, since the patterns were defined in both the matching engine and the pattern files. Erie solves these problems by generating a pattern matching engine in C language directly from the defined patterns.
منابع مشابه
University of Durham: Description of the LOLITA system as Used in MUC-7
LOLITA has been designed in such a way that the code implementing the MUC tasks is only a small part of the whole system. A core system provides complex facilities with the MUC system being built so that it utilises these facilities. Hence, after some background to the LOLITA project, the ‘core’ of LOLITA is described. This system description is substantially similar to that given for MUC-6 [1]...
متن کاملCRL/NMSU: description of the CRL/NMSU systems used for MUC-6
CRL submitted two systems for the Named Entity task . One of these (Basic) is an improved version of the CRL name recognizer developed in phase one of Tipster[1]. The second (AutoLearn) is a system which learns automatically from training data . The Basic system had approximatel y six man months of work in its original development . Improvements for MUC-6 were carried out by one graduate studen...
متن کاملAmerican University in Cairo: Description of the American University in Cairo's System Used for MUC-7
Portions of the American University in Cairo's MUC-7 system, MUC7-Plink, have participated in every Message Understanding Competition since MUC-4. The Plink parser was developed at the University of Michigan where it formed the core of the systems entered in MUC-4 [2] and MUC-5 [1]. Recently, the Plink parser was added to GATE [6] to facilitate interaction between language processing modules. M...
متن کاملDescription of the UPENN CAMP System as Used for Coreference
Scoring the performance of a system is an extremely important aspect of coreference algorithm performance. The score for a particular run is the single strongest measure of how well the system is performing and it can strongly determine directions for further improvements. In this paper, we present several di erent scoring algorithms and detail their respective strengths and weaknesses for vary...
متن کاملUniversity of Sheffield: Description of the LaSIE-II System as Used for MUC-7
The University of She eld NLP group took part in MUC-7 using the LaSIE-II system, an evolution of the LaSIE (Large Scale Information Extraction) system rst created for participation in MUC-6 [9] and part of a larger research e ort into information extraction underway in our group. LaSIE-II was used to carry out all ve of the MUC-7 tasks and was, in fact, the only system to take part in all of t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1996